Parameters of Pseudo-Random Quantum Circuits 
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Pseudorandom circuits generate quantum states and unitary operators which are approximately 
distributed according to the unitarily invariant Haar measure. We explore how several design param- 
eters affect the efficiency of pseudo-random circuits, with the goal of identifying relevant trade-offs 
and optimizing convergence. The parameters we explore include the choice of single- and two-qubit 
gates, the topology of the underlying physical qubit architecture, the probabilistic application of 
two-qubit gates, as well as circuit size, initialization, and the effect of control constraints. Building 
on the equivalence between pseudo-random circuits and approximate t-designs, a Markov matrix 
approach is employed to analyze asymptotic convergence properties of pseudo-random second-order 
moments to a 2-design. Quantitative results on the convergence rate as a function of the circuit size 
are presented for qubit topologies with a sufficient degree of symmetry. Our results may be theo- 
retically and practically useful to optimize the efficiency of random state and operator generation. 
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I. INTRODUCTION 



Random pure states play a prominent role in quantum 
information processing. Random states are defined with 
respect to the unitarily invariant (so-called Fubini-Study) 
measure on the space of unit vectors in the Hilbcrt space 
of the system [l| . Not only do random states possess the 
remarkable feature of saturating the classical communi- 
cation capacity of a noisy quantum channel Q , they also 
arc an enabling resource for protocols including super- 
dense coding of quantum states Q, approximate quan- 
tum encryption, and quantum data hiding 0, . Ran- 
dom quantum states can also be used for unbiased sam- 
pling, and the characterization of both 'typical' bipar- 
tite and multipartite entanglement in random states has 
long been the focus of extensive investigation 0| , re- 
cent results including estimates of moments of the sub- 
system purity distribution Q, relationships to state 
localization properties , and exact expressions for the 
probability distribution of entanglement measures such 
as G-concurrence and purity [1^ . 

How docs one generate random quantum states? One 
way is to apply a random unitary transformation to ini- 
tial computational basis states. Similar to random states, 
random unitary operators are drawn uniformly from the 
unique unitarily invariant measure - the Haar measure 
on the unitary group U(A^), where N denotes the Hilbert 
space dimension, iV = 2" for a multipartite n-qubit sys- 
tem [l^. Random unitaries themselves are useful for a 
number of quantum protocols ranging from remote state 
preparation to efficient error characterization [l^, [l^ 
and selective process tomography Unfortunately, 
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implementing random unitaries as a sequence of one- and 
two-qubit gates on a quantum computer is inefficient: the 
required number of quantum gates grows quadratically 
with TV, that is, exponentially with the number of qubits, 
n. Therefore, researchers are left with the challenge of 
constructing suitable pseudo-random (PR) substitutes. 

A promising approach to efficiently implement an en- 
semble of unitaries so that the resulting moments approx- 
imate those induced by the Haar measure is provided by 
PR circuits, introduced in Ref. [isl . These circuits con- 
sist of an iterated set of single- and two-qubit gates where 
certain specifications are chosen at random. As the set 
of gates is iterated (using different single-qubit gates for 
each qubit and at each time step), the statistical prop- 
erties of the implemented unitary operators and the re- 
sulting output states compare more and more favorably 
to the properties of random unitaries and states. Addi- 
tional studies of PR circuits have focused on analyzing 
both analytically and numerically the convergence to the 
desired distribution [H, [l^, , identifying optimal two- 
qubit gates, as well as elucidating some aspects related to 
the influence of qubit topology [2ll |. and quantifying the 
ability to efficiently generate states with generic entan- 
glement [22[. PR algorithms have also been formulated 
for cluster-state quantum computation (QC) in Ref. [H, 
allowing an optimal single-gate distribution to be identi- 
fied. A yet different realization via local measurements 
on weighted graph states has been proposed in [gj] . 

Our goal in this paper is to quantitatively investigate 
a number of parameters that affect the convergence of 
PR circuits to the Haar distribution. After providing the 
necessary background on the mathematical framework 
employed to characterize PR behavior in Sec. H, we pro- 
ceed in Sec. HI to assess the influence of three main 
design parameters in a PR circuit on a fixed number of 
qubits: the choice of single- and two-qubit gate distribu- 
tions; the influence of different qubit topologies; and the 
effect of probabilistic versus deterministic two-qubit gate 
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application. Not surprisingly, these parameters are inter- 
twined, causing the optimal choice for any one parameter 
to depend on one or several other parameters in complex 
ways. The influence of circuit size is addressed in Sec. 
IV. Remarkably, explicit scaling predictions turn out to 
be possible based on simple analytical expressions which 
are consistent with existing numerical evidence and an- 
alytical results. Sees. V and VI are devoted to analyze 
the effect of limited (non-selective) control and of differ- 
ent initial states, respectively. In Sec. VII we make some 
final remarks and conclude. Additional considerations 
on optimizing cluster-state PR circuits are included in 
Appendix A, whereas Appendix B includes a proposed 
convergence improvement in the specific yet important 
case of a PR circuit implementing an approximate Clif- 
ford twirl. 



II. QUANTIFYING PSEUDO-RANDOMNESS 

We begin by discussing possible ways of quantifying 
the distance between the ensemble generated by PR cir- 
cuits and the Haar-distributed ensemble. It is impor- 
tant to realize that any probability distribution over n- 
qubit quantum states or unitary transformations requires 
a number of parameters exponentially growing with n to 
specify. Therefore, it is impractical to gauge how well 
an ensemble of quantum states or unitary transforma- 
tions resembles the uniform Haar ensemble based on a 
full characterization of the distribution. 



A. t-designs 

For several tasks which utilize random states and uni- 
tary transformations, the details of the full distribution 
are not relevant, in the sense that only statistical mo- 
ments up to a finite order of the ensemble from which 
the random states or unitaries are drawn need to coin- 
cide with Haar-induced moments. That is, it suffices that 
the relevant probability distribution be indistinguishable 
from the uniform Haar distribution as long finite 
number of moments is given. This is captured by the 
concept of a quantum t-design [1^. Formally, an ensem- 
ble of states, [p{a)da,\ip{a))], (or, respectively, unitary 
transformations, [p{a)da,U{a)]) is a state (unitary) t- 
design if for any polynomial / of order {t,t) of the state- 
vector components (or matrix elements), 

j p{a)fma)))da^ J f ma))) da, 

or, 

J p{a)f{U{a))da - j f{U{a))da, 

where the integrals are taken with respect to the invari- 
ant (Fubini-Study or Haar) measure, respectively. A {t,t) 



polynomial refers to a linear combination of terms con- 
sisting of products of up to t variables and their t complex 
conjugates. 

In order to assess how well a PR circuit approximates 
a t-design, an appropriate norm must be defined on the 
space of {t,t) polynomials describing moments of the 
pseudo random circuit. We note that the concept of an e- 
approximate t-design for quantum states was introduced 
in |25| as an ensemble of states [p{a)da, \4'{a))] satisfying 

(l-e)! /(|^(a)))da < j p{a) f ma)))da 

< + fma)))da, 

for all {t,t) polynomials /. This notion induces a norm 
on the space of all {t,t) polynomials (specifically the ^oo- 
norm), which can quantify the distance of a distribution 
approximating an exact t-design. In our study, we shall 
focus on asymptotic convergence rates of arbitrary (2,2) 
polynomials to their expected value under the Haar mea- 
sure as a function of PR circuit depth. Note that Ref. [2^ 
has previously shown that a large class of PR circuits are 
efficient approximate unitary 2-designs, with respect to 
the diamond norm. 

B. Markov Chain analysis 

The case t = 2 is especially relevant, since it includes 
the majority of known protocols which utilize random 
states. Notable examples include unbiased noise estima- 
tion [H, and the generation of states with typical 
entanglement [13]. It is thus important to determine the 
behavior of (2, 2) polynomials (second moments) of the 
state components under the action of PR circuits. As 
shown in [22l |. the evolution of the second moments can 
be mapped to an appropriately defined classical Markov 
chain. Thus, convergence properties of the PR circuit 
to a 2-design may be directly established by exploiting 
properties of the corresponding Markov chain. 

To obtain the mapping, the density operator describing 
the pure quantum state being evolved by the PR circuit 
is written in the Pauli basis: 

p = |^)(V^| =^c.P., 

where = crj'^ ® . . . ® cr^" is a tensor-product string 
of single-qubit identity and Pauli operators, specified by 
the collective index i/ S 3 = {0, cc, y, z}". Let PR(£) 
be the family of PR circuits of depth ^. We shall be 
interested in the evolution of the second order moments, 
{Epf{(£)(cj/^^c^^£)}, as a function of the depth, ^, of the 
PR circuit. It will be shown by construction that the 
moments {EpR(f) (c^ ^)} follow a discrete Markov chain 
on J. That is, the evolution of W.pTx.(i){c^ i) satisfies 

EpR(£+l)(c^/+l) = '^^'iiiu^VK{i){cl^i) ^^Mf^^clg, 



3 



where M = {M^^} is a Markov matrix. For the random 
circuits examined here, the remaining second moments 
{EpR(£) (ciy^^c^^f)}, with V ^ vanish for ^ > 0. 

The class of PR circuits we are interested in have the 
foUowing structure: In a single time step, local (single- 
qubit) gates are applied to each qubit in parallel, followed 
by commuting two-qubit gates belonging to the Clifford 
group between a specified set of neighboring qubits. Typ- 
ically, the two-qubit gates will be conditional phase gates 
CZ= |0)i(0|i (g) I2 + ® 0-2. Under a single-qubit 

gate, each non-trivial Pauli operator transforms as 

where i? = \xah\ € SO (3) depends on the applied rota- 
tion. The corresponding transformation matrix R acting 
on the space of local Pauli operators (including the iden- 
tity, 1) is obtained by averaging the squared coefficients 
over the parameters specifying the desired distribution of 
rotations. We examine only those local gate distributions 
that satisfy 

"^{i^ca^cb) = 0, Va, 6, c. 

This condition ensures that i? = 1 ® E(a;^j) is indeed 
a Markov matrix. Assuming that each local gate is se- 
lected independently from the same distribution (see Sec. 
V for a different setting), the transformation resulting 
from simultaneous single-qubit rotations within a PR it- 
eration is the n-fold tensor product of these single-qubit 
transformations, L = i?*^". Since each two-qubit gate is 
a member of the Clifford group, it simply acts (up to ir- 
relevant phases) as a permutation on the columns of L. 
Thus, the full 4"-dimensional transformation is a Markov 
matrix provided that R is. 

Once the mapping is established, basic properties of 
Markov chains may be used to analyze the evolution of 
the second moments of the PR circuit. Specifically, if the 
Markov chain is ergodic, then the corresponding PR cir- 
cuit converges to a 2-dcsign. A Markov chain is crgodic 
if it is irreducible and aperiodic. That is, every state 
must be reached from every other state and the recur- 
rence times to any state must not be multiples of some 
period k > \. A consequence of ergodicity is that for 
any initial distribution Vo over 3, there exists a unique 
asymptotic distribution, Voo, towards which the chain 
evolves. To illustrate this, we may expand the initial 
state in terms of the (linearly independent) eigenvectors 
{e„} of the Markov matrix, Vt = M'^v^ = J2n Ki'^nGn- 
For an ergodic chain, there is only one eigenvalue equal 
to 1, whereas all other eigenvalues have magnitude less 
than 1. Thus, the contribution to v-r from each eigenvec- 
tor (other than the ergodic eigenvector that is equal to 
1) decays to zero exponentially at a rate governed by the 
corresponding eigenvalue. For times r log(Afe/Ai), the 
contribution from the sub-dominant eigenvalue, Ai, will 
be the dominant non-ergodic contribution, leaving 

\\Vr - VooWoo ^ e"^'' , (1) 



where F = — In(Ai), and A = 1 — Ai is the spectral gap 
of the Markov chain. Since any (2,2) polynomial may be 
expressed in terms of the moments {EpR(£) (c^^fC^,^)} it 
follows that at sufficiently long times, r, the difference 
between the expected value of any (2, 2) polynomial of 
the state vector components over the t-dcsign distribu- 
tion and the Haar-induccd expected value obeys 

|/2-dcsig„(T)-/Haar(T)| ~e-r-. (2) 

As we shall see, certain entanglement measures which 
serve as useful test functions for PR circuits are express- 
ible as (2, 2) polynomials and will thus converge expo- 
nentially to their Haar-expected value at a rate of F. 

1. Markov chain reduction 

A Markov chain may be reduced by identifying a par- 
tition of the state space, 3, into subsets, 3u C 3, such 
that the coarse-grained probability distribution obtained 
by summing over each subset follows a reduced Markov 
chain Af '. A necessary and sufficient condition for reduc- 
ing the chain is that the sum of the transition probabil- 
ities from a member of a subset, fi £ 3u, to all members 
of any subset, G tJ^, is the same for each member /z, 
that is, J2fi ue3y ^^m'' = ^uv must be the same for all 
u € 8u and all subsets 3v It is important to note that 
each eigenvalue of M' is an eigenvalue of M. 

We employ a reduced representation by averaging over 
local X and Y Pauli operators. Since CZ gates are invari- 
ant under rotations along the z-axis, we may restrict our 
choice of single-qubit gate distributions to those which 
initially randomize states in the x-y plane. Now let P 
be a Pauli string containing at least one Xi or 1^, and 
let P' be any string obtained from P by permuting Xi 
with Yi. Since M randomizes X^ and Y,, M{P-P') = 0. 
This defines the kernel of M, which may be removed by 
defining new variables = Xi ±Yi. Chain states in- 
cluding S~ may be discarded, whereas transitions within 
3' = {0,z,^}" are described by M' . There is now only 
one parameter left to characterize the local gate distri- 
bution. Let c G [0, 1] parameterize the extent to which 
the z-axis is left invariant. We shall refer to c as the 
local gate parameter henceforth. The single-qubit gate 
contribution to M' is then fully described by: 

/I \ 

R{c) = c . 
Vo 1-c ^) 

Haar-distributed rotations in SU(2) correspond to c = 
1/3. So-called HZ gates, single-qubit Hadamard gates 
followed by a random rotation about the z-axis, corre- 
spond to c = 0. The latter were identified in [2^ as op- 
timal single-qubit gates when using two-qubit CZ gates. 

In some instances, it is possible to further reduce the 
Markov chain by taking into account qubit-pcrmutation 
symmetries in the construction of PR circuits (sec Sec. 
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IV). The corresponding Markov chain will admit a re- 
duced chain by forming equivalence classes of Pauli 
strings under symmetry operations. In the case of full 
permutation symmetry, equivalence classes may be la- 
beled by the number of X's and Z's in each Pauli string. 
Accordingly, the size of the reduced state space grows 
only quadratically in the number of qubits n, allowing 
scaling behavior of Markov matrix properties with cir- 
cuit size to be determined numerically. Note that while 
the reduced representation enabled by the initial ran- 
domization of X and y contains all of the non-zero eigen- 
values of the full Markov chain, reduced representations 
induced by qubit permutations discard eigenvalues asso- 
ciated with asymmetry with respect to the permutations. 
Thus, the behavior of test functions which do not possess 
the symmetry cannot be predicted if the initial state does 
not also share the symmetry which permits the reduction. 



C. Entanglement measures 

As remarked, entanglement properties of random pure 
states have been extensively studied. In particular, 
bipartite-entanglement across a partition is known to be 
nearly maximal for random states. As observed in [2^ . 
linear entropy, a measure of bipartite entanglement, may 
be expressed as a (2, 2) polynomial in the state compo- 
nents, allowing the rate at which bipartite entanglement 
approaches its expected Haar-value to be determined by 
Markov analysis. Both features - that the asymptotic 
value of the entanglement is nearly maximal, and that 
the simplest measure of the entanglement is a (2, 2) poly- 
nomial in the state vector components - are not limited 
to bipartite entanglement but also apply to a more gen- 
eral class of entanglement properties that are captured 
by the Generalized Entanglement (GE) approach [27| . 

The basic idea of GE is to abstract the notion of en- 
tanglement from a preferred subsystem decomposition 
and instead tie it to a distinguished set of observables to 
which one has access (in an appropriately defined sense) . 
Intuitively speaking, a pure quantum state \ip) is defined 
as 'generalized unentangled' relative to a distinguished 
set of observables, h, depending on whether or not the 
'reduced state' with respect to those observables is pure 
(extremal), and generalized entangled otherwise. That 
is, if one can determine which pure state a system is in 
by only making reference to the expectation values of ob- 
servables in h, then the reduced state is pure. Under ap- 
propriate mathematical assumptions on h [28l |. the sim- 
plest way to quantify GE is by taking the square length of 
the projection of the full pure state |^/')(V-'| onto h, yield- 
ing the so-called /i-purity. Formally, the latter is defined 
as P/i(|V')) = fiJ2i{''P\^iW^ 1 where k > is a normahza- 
tion constant and {Ai} is an orthonormal (with respect 
to the trace norm) basis of h. GE relative to h may be 
naturally quantified as GE/j = 1 — P/j. A simple represen- 
tative of this class of state functionals in the conventional 
multi-partite setting is the global entanglement measure 



of Meyer and Wallach [27|, \M \2£ 



Qm)^2^-J2Tr[p% 



(3) 



where pj is the reduced density matrix of qubit j. The 
expectation of Q over random pure states is 



ubit 7. 



N -2 
N + 1' 



(4) 



The difference between the average entanglement of PR 
algorithm output states and random state entanglement, 
1(0) ^ (Q-r)!; provides a distance indicator between PR 
and random states whose asymptotic behavior should be 
predicted by the Markov matrix analysis. The average 
Meyer- Wallach entanglement has been examined in pre- 
vious studies of PR circuits further motivating its 
consideration in the present context. 



D. State- vector element distribution 

The other method we use to assess the quality of our 
PR circuits is based on the distribution of state vector 
probability in the computational basis. Let c'j. denote the 
kth component of the ^th state randomly chosen from the 
set of all pure states, with ry = |c'j,p being the correspond- 
ing probability. The distribution of rj for random states 
is given by the Porter-Thomas (PT) distribution P. [3l]|: 

where N is the Hilbert space dimension. In the limit 
N ^ oo, and upon rescaling to unit mean, the distribu- 
tion becomes 

PpTiy) = e-y, y^Nrj. 

As our second randomness indicator, we shall examine 
the ^2-distance (simply denoted by || • II2 = | • | hence- 
forth) between the distribution of output probabilities at 
each PR iteration and the random (Porter-Thomas) dis- 
tribution. Note that because this test function depends 
on higher order moments of the PR-distribution, it is not 
a priori described by the above-described Markov chain 
analysis and thus can yield additional insight. 



III. OPTIMIZING PSEUDO-RANDOM 
CIRCUITS 

Any universal set of gates used in a PR circuit will 
eventually generate Haar-distributed states and opera- 
tors [ll]. Our goal here is to explore how various de- 
sign parameters affect PR circuits, in an effort to boost 
efficiency by optimizing time and qubit resources. By 
focusing on the network model of QC, we first character- 
ize the optimal choice of single- and two-qubit gates for 
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an open-chain topology as considered in the 'standard' 
PR architecture of Ref. [l^. The effect of different qubit 
topologies and probabilistic gate application is addressed 
next, whereas, for clarity, a similar analysis for cluster- 
state QC is sketched in Appendix A. 



A. Optimal gates for standard circuits 

PR circuits were introduced in the context of liquid- 
state nuclear magnetic resonance quantum information 
processing. In that context it was natural to look at an 
open chain of qubits and to use two-qubit Ising (ZZ) gates 
between nearest neighbor (NN) qubits, 



y 



ZZ 



exp 



as the relevant non-local resource. Subsequently, other 
two-qubit gate s have been suggested for use within PR 
algorithms [221 . A comprehensive study was recently un- 
dertaken [2l| to determine the two-qubit gate for which a 
PR circuit with random singlc-qubit gates will converge 
most quickly to the Haar expected value of bipartite en- 
tanglement as quantified by linear entropy. Recall that 
this is essentially equivalent to determining convergence 
rates of second order moments of the PR circuit. The 
protocol studied calls for a single two-qubit gate to be 
applied per time interval to pairs of qubits which are 
selected at random from all allowable neighboring pairs 
according to either a closed or open chain [3^ . It was 
determined that the so-called XY gate is the two-qubit 
gate which leads to the most efficient PR circuit of this 
type. The XY gate between qubits j and k is given by 



XY 



exp 



However, the use of only a single two-qubit gate per 
time interval introduces an unnecessary inefficiency since 
qubits not involved in the gate do nothing for that time 
interval and, in any experimental implementation, must 
be protected from error. 

We wish to examine to what extent the number of ap- 
plied gates can be traded for the number of time steps 
by applying gates in parallel. One cannot apply all NN 
XY gates during one time interval since they do not com- 
mute. Nevertheless, a more efficient PR circuit using XY 
gates can be obtained if, at each time interval, half the 
NN XY gates are applied. Fig. [1] compares the state 
vector clement distribution distance and average entan- 
glement distance as a function of iteration for a PR al- 
gorithm which uses conditional phase (CZ) gates (which 
are ZZ gates to within single-qubit z-rotations) between 
all NN qubits, and a PR algorithm which uses XY gates 
between half the NN pairs at each iteration (say, 1-2, 
3-4, etc. at odd iterations and 2-3, 4-5, etc. at even it- 
erations). In both cases, the single-qubit rotations are 
drawn randomly from SU(2). It takes significantly more 
time for the CZ-based PR algorithm to be within 10^"' 
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FIG. 1: (Color online) Z2-distance between PR distribution 
of squared moduli of components in the computational basis 
and the Porter- Thomas distribution (Inset: Distance of aver- 
age global entanglement from random value) for CZ (Q) and 
XY ( X ) two-qubit gates using single-qubit random rotations, 
and CZ gates using single-qubit HZ gates (□) (8 qubits, 100 
implementations, all computational basis states). When us- 
ing random single-qubit gates, both indicators come within 
lO""* much more quickly for XY two-qubit gates than for CZ 
gates. Modifying the CZ-based algorithm to use single-qubit 
HZ gates boosts the convergence rate so much that it even out- 
performs the XY-based algorithm with random single-qubit 
rotations. 



of typical entanglement values and PT distribution than 
the XY-based PR algorithm, despite the fact that all NN 
CZ gates are applied at every iteration. 

In all work mentioned above, it was assumed that 
single-qubit rotations should be random with respect to 
the Haar measure on SU(2). Based on studies of cluster- 
state PR algorithms, however, we have recently shown 
[2^ that when using two-qubit CZ gates a restricted set 
of single-qubit rotations, HZ gates, allows for faster con- 
vergence with respect to test functions which are second- 
order polynomials. This is confirmed through the Markov 
analysis reported in Fig. [2] which shows that, for suffi- 
ciently large n, the optimal singlc-qubit gates to use in 
conjunction with the CZ gate is indeed at c = 0, the HZ 
gate. Using these restricted random gates we see in Fig. 
[T] that the CZ-based PR circuit converges much more 
quickly than the CZ-based circuit with random SU(2) 
single-qubit gates. 

What single-qubit gate distribution is optimal for use 
with XY gates? From the inset of Fig. [21 we see that 
the answer depends on the qubit topology. For qubits on 
an open chain, as in standard PR circuits, the optimal 
single-qubit gate is at c ~ .5. For a closed-chain topol- 
ogy, SU(2) random singlc-qubit rotations appear to be 
optimal. 
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B. Effect of different qubit topologies 

1. Modeling strategies and preliminaries 

As mentioned, a standard PR circuit applies two- 
qubit gates between NN qubits on an open chain. Recent 
studies of PR algorithms have implemented two-qubit 
gates between NN qubits on a closed chain {i.e., sub- 
ject to periodic boundary conditions) plj . or between 
randomly chosen qubit pairs [13, HI]. This alters the 
convergence rate of the algorithm by effectively changing 
the qubit topology. In this section, we explore a number 
of representative topologies and the convergence rate of 
the implemented circuits as a function of different single- 
qubit gate distributions. 

Specifically, we investigate the following alternate 
topologies which employ two-qubit CZ gates: (i) NN 
qubits on a closed chain; (ii) a star formation where each 
qubit is connected to a central one; (iii) an all-to-all (AA) 
topology where CZ gates are performed between every 
possible pairing of qubits (note that all such gates com- 
mute and can thus be performed simultaneously). Our 
strategy is to rely on Markov chain analysis to explore 
how different single-qubit gate distributions affect the 
convergence rate for different topologies, thereby iden- 
tifying the optimal single-qubit gate for each topology. 
For PR circuits employing XY gates, open- and closed- 
chain NN topologies are contrasted, as a function of c. 

The circuits studied in this subsection aim at maximiz- 
ing parallelism, by applying as many CZ gates as possi- 
ble per iteration. In this case, the closed-chain topology 
is not ergodic over the entire state space. This can be 
seen by noticing that each qubit undergoes exactly two 
CZ gates per iteration (one with each NN). Under the 
action of pairs of CZ gates, the number of non-identity 
single Pauli operators in the Pauli string describing the 
state will remain even or odd. Within each of these fixed- 
parity subspaces, the closed-chain topology is ergodic and 
the PR circuit will induce exponential convergence to the 
corresponding ergodic state, with a rate calculated from 
the gap of the Markov matrix. The asymptotic entangle- 
ment for an n-qubit closed chain is also affected by the 
presence of the two distinct subspaces. Specifically, the 
total probability that the state of the Markov chain is in 
each subspace remains constant and is determined by the 
projection of the initial state into each subspace. Under 
the action of the Markov chain, whatever total proba- 
bility is found in each subspace is uniformly distributed 
within each subspace, allowing for the asymptotic en- 
tanglement to be calculated. For initial computational 
basis states, the resulting closed-chain (cc) global entan- 
glement is given by: 



En 
k o 



dd V*;; 



Z^fc odd \k) 



(5) 



For n = 8, Qcc = .988235, whereas the expected global 
entanglement value for the Haar distribution (from Eq. 



(|4])) is given by Qn = .988327. Therefore, when we com- 
pare the size of the gap (hence the convergence rate) 
for the different topologies it is important to remember 
that the closed chain is converging to a slightly different 
asymptotic state than the other topologies. 

Note that the same conservation of even or odd number 
of non-identity Pauli operators occurs in the AA topol- 
ogy for an odd number of qubits. This is because each 
qubit is involved in an even number of CZ gates. In this 
case, however, initial computational basis states evenly 
populate the two subspaces, therefore the final state is 
the ergodic state. 



2. Numerical results 

Fig. [2]summarizes the behavior of the Markov spectral 
gap. A, for each topology as a function of c. The opti- 
mal single-qubit gate is dependent both on the topology 
and number of qubits. A certain single-qubit rotation 
may be exceedingly good for a certain topology, but ex- 
tremely poor for another topology. Increasing the num- 
ber of qubits from n = 6 to 8 affects the convergence of 
different topologies in different ways: For certain topolo- 
gies, this increase in the number of qubits causes the gap 
to decrease, while for other topologies the gap increases 
for certain single-qubit rotations but decreases for other 
single-qubit rotations. 

We first address the effect of topology on PR algo- 
rithms using random single-qubit rotations, c = 1/3. The 
Markov analysis for 8 qubits suggests that the AA topol- 
ogy should converge the fastest, followed by the closed- 
chain topology (which, as mentioned above, converges to 
a different state) . The fast convergence of the AA topol- 
ogy is not surprising given that this topology allows for 
simultaneous interaction between all qubit pairs. The 
open chain has the slowest convergence rate. The star 
topology is slower than the closed chain, despite the fact 
that each qubit is only two couplings away from every 
other qubit. The need to traverse the central qubit is 
likely to be the bottleneck to the spread of entanglement. 
This conjecture is supported by our later explorations 
of probabilistic application of two-qubit gates. For all 
topologies, increasing n from 6 to 8 decreases the gap - 
meaning that it takes longer for the algorithm to converge 
to random states. 

If random single-qubit gates are replaced by HZ gates, 
c — 0, the open-chain topology exhibits the largest gap, 
followed by the closed chain. In both cases, the con- 
vergence rate is faster than that of random single-qubit 
gates, and grows when going from 6 to 8 qubits. The HZ 
gates also provide better PR algorithm convergence for 
the star topology, though the rate is smaller for n ~ 8 
than n = 6 in this case. Such a decrease in convergence 
when going from 6 to 8 qubits is more pronounced for 
single-qubit HZ gates than for random ones. Among the 
topologies we have examined, the AA topology is the 
only one which demonstrates better behavior for random 
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FIG. 2: Markov spectral gap, A, versus local gate parame- 
ter, c, for various qubit topologies using two-qubit CZ (main 
panel) and XY gates (inset). Dark lines: n = 6; Light lines: 
n = 8. For CZ gates, the open chain topology (solid lines) 
has a larger gap than any of the other topologies for single- 
qubit HZ gates, c = 0, but the smallest gap for single-qubit 
random gates, c = 1/3 (vertical dotted line). The HZ gate 
is the optimal single-qubit gate for the open chain topology. 
The AA topology (...) has the smallest or second smallest 
gap (depending on the number of qubits) for HZ gates but 
the largest for random gates. Random gates are close to opti- 
mal single-qubit rotations for AA topology. The closed-chain 
topology ( — ■) has the second largest gap for both random ro- 
tations and HZ gates. The optimal single-qubit rotation for 
this topology would be c « .18. The star topology (dashed 
line) has the second smallest gap for both random and HZ 
single-qubit rotations and the optimal single-qubit gate (for 8 
qubits) is also c « .18. For random single-qubit gates, going 
from 6 to 8 qubits decreases the size of the gap for all topolo- 
gies. For single-qubit HZ gates, the gap increases when going 
from 6 to 8 qubits for the open- and closed- chain topologies 
but decreases for the star topology. Inset: A versus c for 
two iterations (such that all couplings have been utilized) of 
PR algorithms using two-qubit XY gates. The closed-chain 
topology ( — ■) has a significantly larger gap than the open 
chain (solid line). The optimal single-qubit gate for 8 qubits 
is the random rotation, c ~ 1/3, for the closed chain, but 
c ~ 1/2 for the open chain. Note the smaller gap (hence 
slower convergence rate) of n = 8 as compared to n = 6. 



single-qubit rotations than for the HZ gates. 

The convergence to random state- vector element distri- 
bution and entanglement for PR circuits using random 
and HZ single-qubit gates is illustrated in Fig. [3] and 
Fig. m respectively. For both quantities in both cases, 
the convergence behavior is in agreement with the pre- 
dictions based on Markov analysis. In particular, one 
sees that when using HZ gates, the AA and star topolo- 
gies become the slowest, and the open- and closed- chain 
topologies have similar convergence rates. Also notice 
that both the open- and closed-chain topologies exhibit 
the entanglement 'cutoff phenomenon' described in [23j . 
For the open (closed) chain, every realization of the PR 



FIG. 3: (Color online) i2-distance between PR distribution of 
squared moduli of components in the computational basis and 
the Porter-Thomas distribution (Inset: Distance of average 
global entanglement from random value) for different qubit 
topologies using CZ two-qubit gates and random single-qubit 
rotations (8 qubits, 100 implementations, all computational 
basis states) . The qubits have been arranged in an open chain 
(0)i a- star formation (*), a closed chain (□), and in such a 
way that all the qubits are connected to each other (0). The 
convergence rate as a function of topology agrees with Markov 
analysis, see Fig. (2] 



algorithm for n/2 {ti/2 — 1) iterations is maximally en- 
tangled. Only after this point does exponential decay to 
random entanglement values set in [3J]. 

Results on the convergence of output state element 
distribution and entanglement for PR circuits employing 
XY gates are given in Fig. [Sj As above, in this case the 
PR circuit implements every other NN coupling for one 
iteration. Similar to CZ gates, the closed-chain topol- 
ogy does not result in convergence to the random state. 
Nevertheless, the convergence rate of the closed chain to 
random entanglement values and element distributions is 
faster than that of the open chain. 

In summary, the optimal single-qubit distribution for 
each of the topologies exploiting two-qubit CZ gates is 
as follows: for the open chain, the HZ gates are opti- 
mal, c = 0. Remarkably, these gates arc naturally moti- 
vated by cluster-state QC [1^. For the AA topology, the 
random single-qubit rotation is optimal, whereas for the 
closed-chain and star topologies the optimal single-qubit 
rotations correspond to c « .18. For chain topologies, 
the optimal c-value is also dependent on the number of 
qubits. If two-qubit XY gates are used, gate distribu- 
tions with c ~ 1/2 and c ~ 1/3 are found to be optimal 
for open and closed chains, respectively. 
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FIG. 4: (Color online) Z2-distance between PR distribution of 
squared moduli of components in the computational basis and 
the Porter-Thomas distribution (Inset: Distance of average 
global entanglement from random value) for different gate 
topologies using CZ gates and single-qubit HZ gates (8 qubits, 
100 implementations, all computational basis states). The 
qubits have been arranged in an open chain (O)) closed chain 
(□), a star formation (*), and in such a way that all the qubits 
are connected to each other (0). The decay rates of the star 
and AA topologies are now slower than that of the open and 
closed chain, as expected from the Markov analysis, see Fig.[2l 



C. Probabilistic gates 

When PR algorithms employ two-qubit CZ gates, com- 
mutation allows any number of such gates to be per- 
formed in parallel. Therefore, until this point, we have 
constructed algorithms that maximize the number of pos- 
sible CZ gates based on the qubit topology per iteration. 
We proceed to explore what happens if CZ gates are ap- 
plied probabilistically, that is, any specific CZ gate is 
applied with probability p. As we shall see, for certain 
topologies lowering p actually increases the convergence 
rate of the PR circuit. 

We begin by considering a probabilistic open chain 
topology. As illustrated in Fig. [6l the Markov spectral 
gap (hence the convergence rate) increases as p is var- 
ied over [0, 1], p = 1 recovering the deterministic case. 
The HZ single-qubit gate turns out to be optimal for al- 
most all p, the improvement in the convergence rate be- 
ing steady until p is close to unity. We expect this since 
HZ gates are the 'maximally non- invariant' single-qubit 
gates with respect to the z-axis, which is preferred by CZ 
gates. As p — s- 1, a rounded hump is visible whose peak 
value for n = 8 is found at c = .02 but whose rounded- 
ness allows for a range of c values giving nearly optimal 
single-qubit gates. As the number of qubits increases, 
the peak of the hump shifts towards c = 0, indicating 
that the non-optimality of the HZ gates is due to the rel- 
atively small Hilbert space dimension. The gap size and 
convergence rate of PR algorithms using HZ gates as a 



FIG. 5: (Color online) i2-distance between PR distribution of 
squared moduli of components in the computational basis and 
the Porter-Thomas distribution (Inset: Distance of average 
global entanglement from random value) for different gate 
topologies using XY two-qubit gates and random single-qubit 
rotations (8 qubits, 100 implementations, all computational 
basis states). The qubits have been arranged in an open chain 
(O) a-nd closed chain (□). As expected, the convergence rate 
of the closed chain is faster than that of the open chain. 




c 



FIG. 6: (Color online) Markov spectral gap. A, versus local 
gate parameter, c, for an 8-qubit open-chain topology for dif- 
ferent probabilities, p, that each CZ gate will be implemented. 
The gap increases with p and the HZ gate (c = 0) is the op- 
timal single-qubit gate for all p until c — .98. Inset: Gap (•) 
and convergence rate ( x ) for HZ gates as a function of p. 



function of p are explicitly shown in the inset of Fig. [S] 

The behavior of a probabilistic closed-chain topology 
is shown in Fig. [T] Initially, as p increases so does the 
gap and the rate of convergence to random. The HZ 
single-qubit gate is optimal until p ~ .85 (for n = 8). As 
p increases further, a hump develops whose maximum 
increases until p ~ .89, and decreases afterwards. The 
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FIG. 7: (Color online) Markov spectral gap, A, versus local 
gate parameter, c, for an 8-qubit closed-chain topology with 
probability p that each CZ gate will be implemented. The 
HZ gate is optimal until p ~ .85. Above that, a hump grows 
towards greater c with increasing p. The height of the hump 
(size of the gap) reaches a maximum at p ~ .89, c ~ 4/90 
(see upper inset). As p continues to grow, the peak of the 
hump moves to higher values of c but now the size of the gap 
shrinks. Lower inset: Gap (•) and convergence rate (x) for 
HZ gates as a function of p. For .9 < p < 1, the decreased gap 
size is due to an eigenvalue representing coupling between the 
two parity subspaces of the system (see text) and thus is not 
reliable to fully determine the actual convergence rate of the 
circuit. 



maximum of the hump shifts towards larger values of c as 
p approaches one, resulting in optimized convergence at 
p ~ .89, c ~ 4/90. Observe that the optimal probability, 
p ~ .89, is close to p = .875 = 7/8: For an 8 qubit 
system, this is the value for which, on average, there is 
one CZ gate per iteration that is not implemented. In 
other words, for this p value the closed chain is similar 
to an open chain with p = 1, except that the opening, 
corresponding to the missed CZ gate, changes position 
randomly at each iteration. This variable opening in the 
qubit chain is what increases the convergence rate to the 
point where the maximum attainable convergence rate 
on an open chain is outperformed by a closed chain with 
p > .7. As the number of qubits increases, the hump is 
pushed to higher values of p, consistent with the fact that 
as n grows, the chances of one two-qubit gate not being 
applied, i.e. that there is a break in the chain, increases 
accordingly. 

As noted above, the closed chain with p — 1 does not 
converge to random due to the conservation of even or 
odd labeled non-identity Pauli operators. However, for 
p < 1 this conservation rule does not hold and ergodic- 
ity is restored. Nevertheless, based on calculations which 
directly compare the state of the system after many iter- 
ations of the Markov matrix to the random state, it ap- 
pears that a transition occurs at p ~ .8, roughly the value 
where the gap begins to decrease. Below this point, the 



FIG. 8: (Color online) Markov spectral gap. A, versus local 
gate parameter, c, for an 8-qubit star topology with proba- 
bility p that a given CZ gate will be implemented. The gap 
increases with p and the HZ gate (c = 0) is the optimal single- 
qubit gate forp < .7. Inset: Gap (■) and convergence rate (x) 
for HZ gates as a function of p. 



mixing between the two parity subspaces is on par with 
the convergence to random within each subspace. For 
p > .8, the relaxation between the subspaces decreases 
~ eventually increasing the time for the PR circuit to 
reach the final random state. This is demonstrated in 
the lower inset of Fig. [71 which shows the gap size and 
convergence rate of PR algorithms using HZ singlc-qubit 
gates within the closed chain topology as a function of 
p. For .9 < p < 1, the gap plummets and then goes back 
up at p 1. This small gap is due to the eigenvalue 
corresponding to the rate of transfer between the two 
subspaces of the system. In terms of entanglement dis- 
tribution, a fast convergence to the value given by Eq. ([5|) 
occurs first, followed by a slower convergence to the ex- 
pected Haar value at a rate determined by the small gap. 

Figure [5] shows the Markov matrix gap for a proba- 
bilistic star topology. For p < .7, the HZ is the optimal 
single qubit gate, while for p ^ -7 a hump forms such 
that the maximum gap is no longer at c = 0. Unlike the 
closed-chain topology, the peak of the hump is now al- 
ways lower than the maximum overall gap, which is found 
at c = 0,p ~ .7 for ti = 8. Note that for .6 < p < .8, 
the convergence rate is faster than that of the open-chain 
topology with p = 1. The star topology has (n — 1) pos- 
sible CZ gates thus p ~ .7 means that there are about 
two inactive couplings per iteration. The curves for the 
various values of p appear to be generally independent of 
the number of qubits. 

Lastly, a probabilistic AA topology is analyzed in Fig. 
O The observed gap values are much larger than any we 
have seen to this point. The optimal single-qubit rotation 
once again depends on p. For p < .5, the optimal single- 
qubit rotation is the HZ gate, whereas for larger p the 
optimal choice of c increases until reaching c ~ 1/3 when 
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FIG. 9: (Color online) Markov spectral gap, A, versus local 
gate parameter, c, for an 8-qubit AA topology with probabil- 
ity p that a given CZ gate will be implemented. The largest 
gap appears as a singularity at p = .5 and c = 0. For p < .5, 
the gap increases with decreasing c, the HZ gate (c = 0) being 
optimal. For p > .5, the largest possible gap decreases and 
the optimal single-qubit gate approaches random, c — 1/3, 
with increasing p. In addition, the size of the gap as a func- 
tion of c exhibits a sharp change of behavior after reaching its 
maximum. Inset: Gap (•) and convergence rate (x) for HZ 
gates as a function of p. The singularity at p = .5 is easily 
seen, as is the difference in the behavior of the gap size as a 
function of p above and below this singularity. 



p = 1. Unlike the other topologies, the gap behavior 
as a function of c for p > .5 exhibits a sharp change 
after reaching a maximum. At p = .5 and c = 0, a 
sharp singularity occurs in both c and p, resulting in an 
extremely large gap, A — .9961. Remarkably, the size 
of this gap approaches one exponentially as the number 
of qubits increases. We will explore this specific case in 
more detail in the next section. 



IV. SCALING BEHAVIOR 

In this section we look at the effect of scaling on PR cir- 
cuits with certain qubit topologies. Specifically, symme- 
tries in the AA and star topologies allow for the compu- 
tation of Markov matrix gaps for PR circuits with a large 
number of qubits. In addition, utilizing the AA topology 
with probabilistically applied two-qubit gates allows us 
to give quantitative predictions on expected scaling be- 
haviors for different classes of qubit topologies based on 
their degree of connectivity (as measured by how many 
qubits are on average connected to each other via CZ 
gates). This results in a simple formula for the scaling of 
the spectral gap for each class, which is consistent with 
the scaling results known to date. 



A. All-to-all topology 

Each time step of a PR circuit using an AA topology 
consists of independent local gates on each qubit, fol- 
lowed by CZ gates between arbitrary pairs of qubits. We 
apply these gates probabilistically, in such a way that for 
each possible pairing of qubits a CZ gate is performed 
independently with probability p. 

In order to access larger numbers of qubits. we con- 
struct a reduced representation by considering equiva- 
lence classes of Pauli operators under permutations of the 
qubits. Thus, we label the states of the reduced Markov 
matrix by the number of Z- and X-Pauli operators they 
contain. As noted previously, although each eigenvalue 
of the reduced chain occurs in the full representation, 
some eigenvalues of the full chain do not occur in the re- 
duced chain, allowing for the possibility that the largest 
eigenvalue could, in principle, be removed. In practice, 
we verified that for up to n = 10 where comparisons with 
the full representation are tractable, the eigenvalue that 
determines the gap occurs in both representations. Fur- 
thermore, the eigenvalue that governs the decay of com- 
putational basis state must occur in the reduced repre- 
sentation, because the second moments of computational 
basis states are invariant under qubit permutations. 

For fixed p, numerical results show that the scaling 
behavior of the gap upon increasing the number of qubits 
differs depending on whether c = or not. For c > 0, the 
gap appears to converge at an exponential rate to 1 — c, 

A(^>")(p;n) = (1-c) -e-''(f)", (6) 

with a convergence rate a{p) which is peaked at p — 0.5. 
When c = 0, however, the gap scales as 

A("-")(p< .5;n) ^ 1- ^ , , , , (7) 

where a and P are p-dependent factors, which are of order 
unity for p < .5. Thus, in this case the gap converges 
exponentially to one. 

Extremely interesting behavior of the gap occurs when 
c = and p = .5. In this case, the eigenvalues of the 
Markov chain are found to exhibit a very simple struc- 
ture: there is the ergodic eigenvector with eigenvalue 1, a 
single eigenvalue of — 2~", multiply degenerate eigenval- 
ues of 2~"/^ and — 2~"/^, and all remaining eigenvalues 
are zero. Thus, the gap is given by 

A(-")(p^.5;n) = l-^. (8) 

This holds for generic initial states. Initial computational 
basis states, however, have a non-zero component only 
along the ergodic eigenvector (with eigenvalue 1) and the 
single eigenvector with eigenvalue 1/2". This leads to a 
gap of 

A(^=°™)(p=-5;n) = l-^, (9) 
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FIG. 10: 1 — A, versus number of qubits for the AA topology 
using probabilistically applied CZ gates. Dark curves: c — .01 
and p = .25(0), -35 (x), and .45 (□). Light curves: c = .1 
using the same values of p. In both cases, the gap approaches 
1 — c for increasing n, consistent with Eq. Upper inset: 
1 — A versus number of qubits for the AA topology when c = 
and p — .2 (dotted line), .4 (dashed line), and .5 (solid line). 
The gap approaches one at an exponential rate for all values 
of p with the quickest convergence at p = .5, as captured by 
Eqs. (0 and ((5)). Lower inset: Gap versus number of qubits 
for deterministic AA topology, p — 1, for single-qubit HZ (Q) 
and random (x) gates. The gap grows with increased n until 
saturating at about .3596 for the HZ gates and .4444 for the 
random gates. 



which is the fastest convergence rate for any PR circuit 
we have examined. The size of the gap as a function of 
the number of qubits for p < .5 is shown in Fig. 1101 

We can use Eq. ([7]) to understand and generalize ex- 
isting results on the gap scaling behavior for PR circuits. 
First, let us review what is known about the scaling be- 
havior of the gap with regard to the number of two-qubit 
gates applied per time step. For a PR circuit in which 
a single two-qubit gate uniformly drawn from the Haar 
distribution on SU(4) is implemented per time step be- 
tween a pair of qubits selected at random, an analytical 
result of 

AW(ri) ^ e(l/n) 

was proved by Harrow and Low in [2^. In addition, 
numerical work by Znidaric (2T| demonstrated the above 
1 /n scaling law to be typical for a large class of two-qubit 
gates used in conjunction with random SU(2) single- 
qubit gates. For PR circuits where ~ n commuting two- 
qubit gates are performed in parallel per time step, nu- 
merical evidence suggests that the gap tends to a con- 
stant value as the number of qubits increases 

A["l(n) = const, 

the value of such a constant depending in general on the 
type of gates and qubit topology. For the AA topology 



under discussion here for fixed p there are ^ n'^ commut- 
ing two-qubit gates performed in parallel per time step. 
We find for the case oip < 1 and c = a scaling behavior 
(as in Eq. ([7])) of the form 



A["'l(n) = 1 - 



1 



K > 0, 



and constant scaling otherwise. 

It is enlightening to compare the convergence rates for 
each of the above cases in terms of the number of two- 
qubit gates applied rather than the number of time steps. 
Let N2 denote the number of two-qubit gates applied per 
time step. In order to compare the gap, A = 1 — Ai, for 
a protocol in which N2 = 1 to one where N2 ^ n (such 
as the open-chain, closed-chain, and star topologies) or 



No 



effective gap, 



(such as in the AA topology), wc postulate an 



AN2 

^off 



1 - A 



[N2] 



corresponding to a Markov matrix obtained by raising 
the original Markov matrix for which there is only one 
two-qubit gate per time step, to the power 7V2, so that 
the same number of two-qubit gates arc considered. Pre- 
suming the gap for a protocol with 7V2 = 1 is typically 
~ it follows that for N2 ~ n, in the limit where as n 
is sufficiently large, 



aS= hm 1-- 



yielding a gap of A|,g = 1 



which is a constant. 



For N2 



this effective gap goes to A 



cff 



1 



which asymptotically approaches 1, consistent with the 
above numerical results. 

We may numerically explore all of the above cases 
within the AA topology by allowing p, the probability 
of performing a two-qubit gate between a given (arbi- 
trary) pair of qubits, to be a function of n. We consider 
the special case of c = 0. First, set 



n{n — 1) ' 



so that on average one CZ gate is performed per time 
step. We find numerically (for the AA configuration) 
that the gap is given asymptotically by A^^l (n) 
consistent with the above mentioned results. 
Next, set 



1.34 
-1.55 ' 



V 



so that an average of n CZ gates are applied per time 
step. In the large-n limit, this results numerically in a 
constant gap of A^"! = .705. Finally, for fixed p < 0.5, 
the gap approaches one as in Eq. ([7]). The above results 
are demonstrated in Fig. [Tl] 

In summary, we find that for a broad class of random 
circuits the gap scales as A = 1 — exp(— q;A^2/''^), ct > 
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FIG. 11: Spectral gap, A, of a PR circuit using an AA topol- 
ogy as a function of the number of qubits, n, with two-qubit 
gates applied with probability p = such that at each iter- 
ation the number of gates performed is on average n. The gap 
asymptotically approaches a constant value of ~ .705 (dashed 
line). The upper inset shows the inverse gap, 1/A, as a func- 
tion of n for p — so that on average one two-qubit 
gate is performed per iteration. The gap scales as oc 1/n. 
The lower inset shows (1 — A) for 50 qubits as a function of p. 
The gap is largest, and hence convergence quickest, at p = .5. 
The approximate formula for the gap when p < .5 is given by 
A = 1 — 1/2"'' and is shown as a dashed line. 

0. Recalling that the convergence rate, F = — ln(l — 
A) = aN2/n, we find that the convergence rate scales 
asymptotically as ^ 1/n per two-qubit gate. This scaling 
behavior appears to be typical for all PR circuits where 
A^2 = 1 to ~ n and for some types of PR circuits with 
when N2 ^ n^ , specifically when p < 1 and c = for the 
AA topology. This is important because if one wishes to 
parallelize by simultaneously performing commuting two- 
qubit gates then it is possible to perform the maximum 
N2 ~ n^ gates without sacrificing the typical 1 /n scaling 
per two-qubit gate of the convergence rate. 



B. Star topology 

For qubits arranged in the star topology, CZ gates 
may be applied only between the central qubit and the 
(?i — 1) outer qubits. With the proper choice of parame- 
ters, this structure also allows (as the AA topology) for 
a simple specification of the Markov matrix eigenvalues. 
Let single-qubit gates be drawn from a distribution with 
c = 1/3 and each CZ gate be applied with a probability 
p = 3/4. Then, the eigenvalues of the Markov matrix 
consist of a (2" — 2)-fold degenerate eigenvalue equal to 
2/3, and a pair of eigenvalues at |(1 ± 2^)- '^^^^ 
maining eigenvalues are all zero, leading to a gap of 



We can also examine a situation where p = 3/4 but 
we draw the single qubit rotation on the central qubit 
from a different distribution than that of the other qubits. 
If, for example, we draw the non-central qubit rotations 
from a random distribution and the central qubit rotation 
from a distribution parameterized by c, the correspond- 
ing Markov matrix develops a (2" — 2)-fold degenerate 
eigenvalue equal to , a pair of eigenvalues at c ± , 
and the remaining eigenvalues are zero. The gap is now 
maximized at .5 when c = for the local gate distribution 
of the central qubit. 

Interestingly, this PR circuit is very similar in structure 
and performance to the approximate 2-design outlined in 
[2^, as discussed in Appendix B. 

V. EFFECT OF COLLECTIVE ROTATIONS 

In all of the above analysis we have assumed PR cir- 
cuits where the single-qubit rotations applied to each 
qubit are drawn independently from some specified dis- 
tribution at each time interval. What would happen if 
we limit the number of times we draw from the given 
distribution? For example, will a PR circuit that applies 
the same random rotation to every qubit (or every other 
qubit) during a given iteration converge more slowly than 
if every qubit rotation is independent? 

Constructing a PR circuit where we decrease the num- 
ber of independently applied single-qubit random rota- 
tions may be viewed as an exercise in generating random 
states (or unitary operators) with as few degrees of free- 
dom as possible. It is well known that quantum chaotic 
systems with only one or two degrees of freedom can 
mimic several properties of random systems [35| . Ex- 
ploring PR circuits by reducing the number of degrees 
of freedom goes one step further, in that it addresses a 
similar question within a general mathematical frame- 
work rather then looking at specific examples of what 
may be extreme cases. This aspect of PR circuits has 
been previously explored in (36| with respect to the uni- 
tary operators applied by the algorithm. Here, we look 
at some further statistics of the generated PR states not 
explored in the previous work and compare them to the 
same characteristics of random states described above. 

Specifically, let us consider an open chain of qubits 
with CZ gates between NN qubits. For the single-qubit 
gates, instead of applying different random or HZ rota- 
tions to every qubit at every PR iteration, we apply the 
same collective random or HZ rotation to all qubits for 
a given iteration. We also look at a case where, instead 
of one single-qubit rotation per iteration, we apply two 
different rotations, the first to odd qubits and the second 
to even qubits. We compare the convergence of the ele- 
ment distribution and entanglement for these two cases 
to that of an open chain where different rotations (ran- 
dom or HZ, respectively) are applied to each qubit. As 
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not the rate of convergence) to randomness may proceed 
more quickly. To demonstrate this, we apply PR circuits 
to initial states of the form: 



10 
Iteration 



FIG. 12: Z2-distance between PR distribution of squared mod- 
uli of components in the computational basis and the Porter- 
Thomas distribution (inset: Distance of average global en- 
tanglement from random value) versus number of iterations 
for open-chain topologies (8 qubits, 100 implementations) in 
which each qubit undergoes equivalent random rotations (O) 
or every other qubit undergoes equivalent random rotations 
(x). Both random single-qubit gates (dark) and HZ gates 
(light) were used. These are compared to circuits in which 
each qubit undergoes a different rotation (□). Interestingly, 
using different rotations on all odd and all even qubits does 
not improve the convergence rate beyond the one achievable 
by using only one collective rotation for all qubits. 



shown in Fig. 1121 limiting the diversity of single-qubit 
rotations appears to have little effect on the convergence 
rate of entanglement. The matrix element distribution, 
however, is strongly affected by the symmetries, and the 
convergence to the FT distribution is significantly slowed 
down by reducing the number of free parameters in the 
PR algorithm. Were a quantum protocol dependent on 
randomness in the state element distribution, these PR 
circuits would require significantly more steps than PR 
circuits with different random rotations on each qubit. 
It is worth noting that for the case in which every qubit 
undergoes the same local gate, the PR gate set is not 
universal over the full Hilbert space because it preserves 
parity with respect to reversing the order of the qubits 
along the chain. Nevertheless, similar to previous cases, 
the asymptotic decay rate is not heavily influenced by 
this fact, although the final asymptotic values of the test 
functions under consideration may differ slightly from 
those expected for the Haar distribution. 



VI. INITIAL STATE DEPENDENCE 

One of the challenges in constructing PR states is cre- 
ating sufficient entanglement using simple one- and two- 
qubit gates. If, however, the initial state already has 
some entanglement the absolute convergence (although 
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where K(a) is the normalization factor. When a — 0, 
the state \ip{0)) is a generalized GHZ state which is max- 
imally entangled relative to arbitrary local observables, 
and K{a) — When a = 1, |^(1)) is completely 

separable, and K{a) = 1/^/N. By applying the PR cir- 
cuit to initial states with different values of a, we can 
explore the number of iterations gained by using an ini- 
tial entangled state. Of course, starting with an initially 
entangled state has a cost. The cost is minimal, however, 
since the above states are easily generated via a series of 
controlled (7a;-rotations by angles which are dependent on 
a. These gates can be implemented simultaneously since 
all gates share the same control qubit. 

Figure [13] shows the difference in state element distri- 
bution and average entanglement between random states 
and states produced from the PR circuit with different 
a initial states as a function of iteration. We have used 
a PR algorithm with an open-chain topology, CZ gates, 
and random single-qubit rotations. All states approach 
the random state entanglement at the same exponential 
rate. Not too surprisingly, however, initial states with 
entanglement closer to that of random states approach 
random state entanglement faster than other states. For 
n = 8, the entanglement of the state a ~ .02337 equals 
that of random states, and has the fastest convergence 
to random entanglement (that the state with entangle- 
ment equal to that of random states converges the fastest 
is more clearly seen for smaller numbers of qubits). As 
the number of qubits grows, the random entanglement 
gets closer to maximum, therefore the GHZ state pro- 
vides optimal convergence. Perhaps more surprisingly, 
the states with more initial entanglement also converge 
to the random state element distribution more quickly 
than their less entangled counterparts. This is in-line 
with the notion that, in general, characteristic entangle- 
ment and state element distribution are correlated [37j . 

Figure [13] allows us to quantify the advantage of start- 
ing with entangled states in terms of the number of iter- 
ations by noting where the different curves cross a given 
distance from random entanglement level. For example, 
the a = state gives an advantage of 4-5 iterations over 
the a ~ 1 state. 

To further test the dependence of a PR algorithm con- 
vergence on the initial state, we have also used as the 
initial state the open-chain cluster state (still utilizing 
a circuit model PR algorithm). We find that the re- 
sults are equivalent to that of the generalized GHZ state. 
Note that with respect to the multi-partite entanglement 
measure we are using, both these states are maximally 
entangled. However, the apparent equivalence between 
the GHZ state and cluster state does not hold for all 
topologies. For example, the convergence of a PR algo- 
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FIG. 13: (Color online) /2-distance between PR distribution of 
squared moduli of components in the computational basis and 
the Porter- Thomas distribution (inset: Distance of average 
global entanglement from random value) versus number of 
iterations of a PR algorithm for 8 qubits (500 samples) and 
initial states a = (O)) — -02337, corresponding to a state 
with random entanglement (A), .1 (x), .2 (□), .3 (+), and 1 
(<i)). All states converge to random at the same rate. Note 
that the a = case has a 4 or 5 iteration advantage over the 
a — 1 initial state. Lower inset: Entanglement of the initial 
states versus a. The random entanglement value is ~ .9883. 

rithm with the qubits in the AA topology does not benefit 
from an initial GHZ state, whereas it does benefit from 
an initial cluster state. 



VII. CONCLUSION 

We have studied the efficiency of PR circuits as a func- 
tion of single-qubit and two-qubit gates, qubit topology, 
the probability of applying the two-qubit gates, and sys- 
tem size, as well as addressed the effect of collective rota- 
tions and different initialization. Using a Markov chain 
analysis, we have analyzed the strong interdependence of 
the choice of single- and two-qubit gates. The optimal 
PR algorithm will consist of a distinct pair of single- and 
two-qubit gates. The optimal choice of gates is also de- 
pendent on the qubit topology. Different topologies will 
favor one or another single- or two-qubits gates. Also 
dependent on the topology, the efficiency of the PR al- 
gorithm may increase if the two-qubit gates are applied 
probabilistically. In particular, we noticed extremely fast 
convergence for the A A topology with p = .5. 

As one of the main implications of our work, we have 
demonstrated how, as the number of qubits is increased, 
the asymptotic convergence rate of second order moments 
of the PR circuit scales as 1/n per two-qubit gate imple- 
mented per time step. This may prove useful to quan- 
titatively characterize PR circuits in terms of t-designs. 
In general, obtaining a deeper understanding of the be- 



havior of PR algorithms as approximate higher-order t- 
designs appears to be an important next step toward har- 
nessing quantum pseudo-randomness. 
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APPENDIX A: CLUSTER-STATE PARAMETERS 

Several potential applications for PR circuits are in 
the area of quantum communications, which is naturally 
suited for photonic implementations. Current analyses 
suggest that linear optics QC will require too many re- 
sources, in terms of amount of optical hardware, for prac- 
tical implementations [11] . The cluster-state approach to 
QC, however, may be less stringent on the number of nec- 
essary elements. 

A cluster state [1^ can be created by first rotating all 
qubits into the state :^(|0) + |1)). Qubits are then en- 
tangled by applying CZ gates between pairs of qubits j 
and k. In a graphical picture of a cluster state, qubits are 
represented by circles and pairs of qubits that have been 
entangled via a CZ gate are connected by a line. A cluster 
state with qubits arranged in a two-dimensional lattice 
such that each qubit has been entangled with four NNs, 
suffices for universal QC [4d|. For photonic realizations of 
cluster-state QC, where two qubit gates can only be ap- 
plied probabilistically, one assumes an unlimited number 
of maximally entangled EPR pairs (attained for example 
via spontaneous parametric down-conversion) which can 
be used to construct larger cluster states using so-called 
fusion operations [4l| . 

Once the desired cluster state has been constructed, 
any QC algorithm can be implemented using only single- 
qubit measurements in the x-y plane. The inherent en- 
tanglement in the cluster state means that measurement 
on one qubit may effect the state of the remaining qubits. 
Thus, one can view each row of the cluster-state lat- 
tice as the evolution of a single-qubit in time. Connec- 
tions between rows are used for two-qubit gates. Pro- 
cessing measurements are performed by column from left 
to right until the last column (left unmeasured), which 
contains the output state of the quantum algorithm, to 
be extracted by a last readout measurement. For a one- 
dimensional cluster chain, the logical operation imple- 
mented by measurement along an angle </> in the x-y 
plane is X(TTm)HZ{(j)), where H is the Hadamard gat e 
and Z{a) {X{a)) is a z- {x-) rotation by an angle a [42l |. 
The dependence of the logical operation on the outcome 
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of the measurement is manifest in to = 0, 1 for measure- 
ment outcome — 1,+1. Any arbitrary rotation can be 
implemented via three logical single-qubit rotations of 
the above sort yielding 



HZ{a + Trma)X{(3 + 'Kmfj)Z{^ + nmy), 



(Al) 



where (a, (3, 7) are the Euler angles of the rotation. By 
drawing the Euler angles according to the Haar measure, 
a random single-qubit rotation can be implemented. Be- 
cause the Haar distribution on SU(2) is invariant un- 
der TT-rotations, the measurement-dependent 7r-rotations 
may be ignored. Two-qubit gates are performed via a 
connection between two rows of the cluster state. CZ 
gates in particular are 'built-in' to the cluster state and 
simple measurement automatically implements the gate. 

PR state generation via the cluster state model of QC 
was first explored in p3l | . The algorithm was initially per- 
formed using three qubits in a chain to implement ran- 
dom rotations, c = 1/3, and a connection between rows 
of the cluster state on every third qubit implemented the 
CZ gates. In this way, £ iterations of an n-qubit PR cir- 
cuit required a lattice of nx 3£+l qubits (where the extra 
1 comes from the final, unmeasured column). However, 
by 'filling in' the extra vertical connections, thus making 
the single-qubit rotations HZ gates with c = 0, such that 
all qubits were connected to their four NNs, the conver- 
gence rate of the PR algorithm increased. 



1. Cluster topologies 
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FIG. 14: (Color online) Z2-distance between PR distribution 
of squared moduli of components in the computational basis 
and the Porter- Thomas distribution (inset: Distance of aver- 
age global entanglement from random value) versus number 
of iterations for different cluster state topologies (8 qubits, 
500 implementations): 2-dimensional lattice (A), cylinder 
(□), star topology measuring the central qubit first (x), star 
topology measuring the central qubit last (0)i ^nd AA (0). 
The cylindrical lattice converges most quickly. The GHZ- 
type topologies (star and AA) converge more slowly and are, 
in this way, like the AA topologies of circuit-model PR algo- 
rithms using two-qubit CZ and one-qubit HZ gates. The open 
and closed chain exhibit cutoff behavior in the entanglement 
convergence, as noted in [23| . 



We explore cluster state topologies where the 'columns' 
of the two-dimensional lattice are connected as circles 
(making the cluster state a cylinder), where all qubits in 
a 'column' are connected to each other (AA) and where 
each qubit is attached only to one central qubit (stars). 
We note that both the AA shape and the star shape are 
graph representations of generalized GHZ states differ- 
ing only by single-qubit rotations [i^. Within the star 
topology, we look at two different measurement strate- 
gies ~ measuring the central qubit first, and measuring 
the central qubit last. 

Figure [T3] shows that the GHZ-type states converge 
more slowly than the two-dimensional lattice and cylin- 
drical topologies with the cylindrical topology converging 
fastest. This is similar to the circuit model for PR algo- 
rithms employing two qubit CZ gates and one qubit HZ 
gates where the AA topology converges the slowest. 



2. Probabilistically applied cluster gates 

Photonic implementations of cluster state QC require 
construction of the appropriate cluster state using op- 
erations that work probabilistically. Thus, any cluster 
state has an associated cost in the amount of resources 
required to, on average, build the given state. Much 



work has gone into finding algorithms that will lower the 
cost of construction for cluster states of two-dimensions, 
i.e. in which each qubit has 4 NNs [ij. In these works, 
cost is generally associated with the number of EPR 
pairs needed for state construction. One conclusion that 
emerges from this work is that the cost of constructing 
one dimensional chains is relatively low and it is joining 
these chains (such that CZ gates can be applied) which is 
the majority of the cost. For this analysis we assume that 
these connections can be attempted without breaking the 
chain (even if the operation is not successful). 

We have already noted that a full two-dimensional clus- 
ter state (with every vertical connection filled in) is opti- 
mal for PR cluster-circuits. However, what if we were to 
apply the cluster CZ gates only probabilistically? While 
we know from our exploration of circuit model PR algo- 
rithms using an open chain topology that the number of 
necessary iterations would increase, a probabilistic appli- 
cation of these gates would cut the construction cost of 
the initial cluster state. 

Let us assume that an attempt to link two cluster 
chains together succeeds with probability p' . We also 
assume the availability of infinitely long cluster chains. 
As an example let p' = 1/2 and we attempt to construct 
a cluster state which can implement C iterations of the 
cluster PR circuit in which the two qubit gates are im- 
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plemented with probability p. On average, construction 
of such a chister state would require 2Cpn probabilistic 
operations, where n is the number of rows (logical qubits) 
in the cluster lattice. Thus, for an 8-qubit cluster PR al- 
gorithm, to construct a cluster capable of implementing 
a PR circuit with a convergence rate of .547 (which is at 
p = .98 as shown in Fig. [S]) requires on average 1.96Cn 
probabilistic operations (recall that the circuit PR algo- 
rithm with HZ gates is equivalent to the cluster-state PR 
algorithm). A PR circuit with half that convergence rate, 
would require 2C iterations but could use an algorithm 
with p ~ .705. This would require on average ~ 2.82Cn 
attempts. Thus, for this case, it is better to construct a 
cluster state capable of higher p value PR circuits. How- 
ever, further analysis is necessary in the more realistic 
case where all resources are accurately counted (includ- 
ing losses taken upon failure of fusion operations), for 
different topologies, and for other values of p and p' . 

APPENDIX B: TWIRLING AS A 
PSEUDO-RANDOM CIRCUIT 

Clifford twirls [1^, or 2-designs, play an important role 
in a number of quantum information protocols. A con- 
struction for an approximate Clifford twirl has recently 
been suggested in [25|. The procedure detailed in [23 
consists of a sequence of twirling operations. A twirl 
consists of i) performing one operation out of a specified 
set of operations; ii) allowing the super-operator. A, to 



occur; iii) undoing the original operation. The net effect 
of the twirl is to convert A into a new super-operator 
A'. In the case of a Clifford twirl, in which the set of 
operations is the Clifford group, any super-operator is 
converted into a depolarizing channel. 

The effect of the sequence of twirling operators on the 
set of Pauli strings can be represented as a Markov ma- 
trix, identical to those that describe the decay of the sec- 
ond moments of a PR circuit. Thus, a one-to one map- 
ping exists between PR circuit constructions and approx- 
imate twirling operations. When the procedure in [25] is 
considered as a PR circuit, it bears a close resemblance 
to PR circuits in a star topology, with probabilistic CZ 
gates, and random local gates applied to the non-central 
qubits. However, the procedure of (25j involves three time 
steps each of which is counted as an application of simul- 
taneous probabilistic CZ gates. An approximate calcula- 
tion of the convergence rate is made in [25l | in order to 
determine scaling properties. We find that an exact cal- 
culation of the gap of the corresponding Markov matrix 
gives Aciifford(c = 1/3) = 5/6, irrespective of the number 
of qubits. We can improve the convergence rate of this 
procedure marginally by using the following, simpler con- 
struction: perform a PR circuit with the star topology 
with c = for the central qubit, while keeping c = 1/3 for 
the non-central qubits and a gate probability of p = 3/4. 
We find a gap of A = 1/2. Since this must be applied 
three times to perform the same number of gates as in 
[25I I , this improved construction yields an effective gap of 

Aciifford(c = 0) = 1 - 1/23 ^ 7/8^ 



[1] I. Bengtsson and K. Zyczkowski, Geometry of Quan- 
tum States: An Introduction to Quantum Entanglement 
(Cambridge University Press, New York, 2006). 

[2] S. Lloyd, Pliys. Rev. A 55, 1613 (1997). 

[3] A. Harrow, P. Hayden, and D. Leung, Phys. Rev. Lett. 
92, 187901 (2004)". 

[4] P. Hayden, D. Leung, P. W. Sfior, and A. Winter, Com- 
mun. Math. Phys. 250, 371 (2004). 

[5] A. Ambainis and A. Sm ith, in: P roc. RANDOM 2004, 
(Cambridge, MA, 2004), |qiiant^/ 0404075 

[6] D. N. Page, Phys. Rev. Lett. 71, 1291 (1993). 

[7] S. K. Foong and S. Kanno, Phys. Rev. Lett. 72, 1148 
(1994). 

[8] A. Scott and C. M. Caves, J. Phys. A 36, 9553 (2003). 
[9] O. Giraud, J. Phys. A 40, 2793 (2007). 
[10] L. Viola and W. G. Brown, J. Phys. A 40, 8109 (2007). 
[11] V. Cappellini, H-.J. Sommers, and K. Zyczkowski, Pliys. 

Rev. A 74, 062322 (2006). 
[12] P. Facchi, G. Florio, and S. Pascazio, Phys. Rev. A 74, 
042331 (2006). 

[13] M. L. Mehta, Random Matrices, (Academic Press, New 
York, 1991). 

[14] C. H. Bennett, P. Hayden, D. Leung, P. W. Shor, and A. 

Winter, IEEE Trans. Inform. Theory 51, 56 (2005). 
[15] J. Emerson, Y. S. Weinstein, M. Saraceno, S. Lloyd, and 

D. G. Cory, Science 302, 2098 (2003). 
[16] B. Levi, C. C. Lopez, J. Emerson, and D. G. Cory, Pliys. 



Rev. A 75, 022314 (2007); J. Emerson et ai, Science 317, 
1893 (2007). 

[17] A. Bendersky, F. Pastawski, and J. P. Paz, Pliys. Rev. 

Lett. 100, 190403 (2008). 
[18] J. Emerson, E. Livine, and S. Lloyd, Phys. Rev. A 72, 

060302(R) (2005). 
[19] Y. S. Weinstein and C. S. Hellberg, Phys. Rev. Lett. 95, 

030501 (2005). 
[20] L. Arnaud and D. Braun, quant-ph/0807 . 0775. 
[21] M. Znidaric, Phys. Rev. A 76, 012318 (2007). 
[22] R. Oliveira, O. C. Dahlsten, and M. B. Plenio, Phys. Rev. 

Lett. 98, 130502 (2007); O. C. O. Dahlsten, R. Oliveira, 

and M. B. Plenio, J. Phys. A 40, 8081 (2007). 
[23] W. G. Brown, Y. S. Weinstein, and L. Viola, Phys. Rev. 

A 77, 040303(R) (2008). 
[24] A. D. K. Plato, O. C. Dahlsten, and M. B. Plenio, 

quant -ph/0806 . 3058. 
[25] C. Dankert, R. Cleve, J. Emerson, E. Livine, 

'quant -ph/0606161 A. Ambainis and J. Emerson, 

quant -ph/0701126 
[26] A. Harrow and R. Low, quant -ph/0802 . 1919. 
[27] H. Barnum, E. Knill, G. Ortiz, and L. Viola, Phys. Rev. 

A 68, 032308 (2003); H. Barnum, E. Knill, G. Ortiz, 

R. Somma, and L. Viola, Phys. Rev. Lett. 92, 107902 

(2004); R. Somma, G. Ortiz, H. Barnum, E. Knill, and 

L. Viola, Phys. Rev. A 70, 042311 (2004). 
[28] H. Barnum, G. Ortiz, R. Somma, and L. Viola, Int. J. 



17 



Theor. Phys. 44, 2127 (2005). 
[29] D. A. Meyer and N. R. Wallach, J. Math. Phys. 43, 4273 
(2002). 

[30] G. K. Brennen, Quant. Inf. Comp. 3, 619 (2003). 

[31] K. Zyczkowski and M. Kus, J. Phys. A 27, 4235 (1994); 
M. Pozniak, K. Zyczkowski, and M. Kus, ibid. 31, 1059 
(1998); F. Haake and K. Zyczkowski, Phys. Rev. A 42, 
R1013 (1990). 

[32] If no restriction is placed on allowable pairs, CZ and XY 
gates yield an equivalent convergence rate when used in 
conjuction with SU(2) single-qubit gates. 

[33] Y. Most, Y. Shimoni, and O. Biham, Phys. Rev. A 76, 
022328 (2007). 

[34] The data is noisy due to the two-fold degeneracy of A2, 
the largest non-trivial eigenvalue of M' . When smoothed, 
the decay is exponential as expected. 

[35] F. Haake, Quantum Signatures of Chaos (Springer, New 
York, 2001). 

[36] Y. S. Weinstein and C. S. Hellberg, Phys. Rev. A 69, 
062301 (2004); Y. S. Weinstein and C. S. Hellberg, ibid. 
71, 014303 (2005). 

[37] Y. S. Weinstein and C. S. Hellberg, Phys. Rev. A 72, 



022331 (2005). 
[38] M. A. Nielsen, Phys. Rev. Lett. 93, 040503 (2004). 
[39] H. J. Briegel and R. Raussendorf, Phys. Rev. Lett. 86, 

910 (2001). 

[40] R. Raussendorf and H. J. Briegel, Phys. Rev. Lett. 86, 
5188 (2001). 

[41] D. E. Browne and T. Rudolph, Phys. Rev. Lett. 95, 
010501 (2005). 

[42] R. Raussendorf, D. E. Browne, and H. J. Briegel, Phys. 

Rev. A 68, 022312 (2003). 
[43] M. Hein, J. Eisert, and H. J. Briegel, Phys. Rev. A 69, 

062311 (2004). 

[44] M. A. Nielsen, Phys. Rev. Lett. 93, 040503 (2004); D. E. 
Browne and T. Rudolph, ibid. 95, 010501 (2005); L. M. 
Duan, R. Raussendorf, ibid. 95, 080503 (2005); Q. Chen, 
J. Cheng, K.L. Wang, J. Du, Phy. Rev. A 73, 012303 
(2006); G. Gilbert, M. Hamrick, and Y. S. Weinstein, 
ibid. 73, 064303 (2006); D. Gross, K. Kieling, and J. 
Eisert, ibid. 74, 042343 (2006). 

[45] D. P. DiVincenzo, D. W. Leung, and B. M. Terhal, IEEE 
Trans. Inf. Theory 48, 580 (2002). 



